Corpus: fra-ch_web_2011_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 99 99 99 99
1000 919 977 995 998 998
10000 6705 8993 9788 9958 9980
100000 6705 8994 9789 9959 9981
1000000 6705 8994 9789 9959 9981


Zipf's diagram for sentence endings


Gnuplot diagram

1411 msec needed at 2023-11-09 02:04